Wyner-Ziv Type Versus Noisy Network Coding 
For a State-Dependent MAC 
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Abstract — We consider a two-user state-dependent mul- 
tiaccess channel in which the states of the channel are 
known non-causally to one of the encoders and only strictly 
causally to the other encoder. Both encoders transmit a 
common message and, in addition, the encoder that knows 
the states non-causally transmits an individual message. 
We find explicit characterizations of the capacity region 
of this communication model in both discrete memoryless 
and memoryless Gaussian cases. The analysis also reveals 
optimal ways of exploiting the knowledge of the state only 
strictly causally at the encoder that sends only the common 
message when such a knowledge is beneficial. The encoders 
collaborate to convey to the decoder a lossy version of the 
state, in addition to transmitting the information messages 
through a generalized Gel'fand-Pinsker binning. Particularly 
important in this problem are the questions of 1) optimal 
ways of performing the state compression and 2) whether 
or not the compression indices should be decoded uniquely. 
We show that both compression a-la noisy network coding, 
i.e., with no binning, and compression using Wyner-Ziv bin- 
ning are optimal. The scheme that uses Wyner-Ziv binning 
shares elements with Cover and El Gamal original compress- 
and-forward, but differs from it mainly in that backward 
decoding is employed instead of forward decoding and the 
compression indices are not decoded uniquely. Finally, by 
exploring the properties of our outer bound, we show that, 
although not required in general, the compression indices 
can in fact be decoded uniquely essentially without altering 
the capacity region, but at the expense of larger alphabets 
sizes for the auxiliary random variables. 

I. Introduction 

Advances in the study of the effect of strictly causal states in 
multiuser channels are rather very recent and concern mainly 
multiple access scenarios. In [1], Lapidoth and Steinberg study 
a two-encoder multiple access channel with independent 
messages and states known causally at the encoders. They 
show that the strictly causal state sequence can be beneficial, 
in the sense that it increases the capacity for this model. This 
result is reminiscent of Dueck's proof [2] that feedback can 
increase the capacity region of some broadcast channels. In 
accordance with [2], the main idea of the achievability result 
in [1] is a block Markov coding scheme in which the two users 
collaborate to describe the state to the decoder by sending 
cooperatively a compressed version of it. As noticed in [1], 
although some non-zero rate that otherwise could be used to 
transmit pure information is spent in describing the state to 



the decoder, the net effect can be an increase in the capacity. 
In [3], they show that strictly causal state information is 
beneficial even if the charmel is controlled by two independent 
states each known to one encoder strictly causally. In this 
case, each encoder can help the other encoder transmit at a 
higher rate by sending a compressed version of its state to the 
decoder. In [4], Li, Simeone and Yener improve the results of 
[1], [3] and extend them to the case of multiple encoders. The 
achievability results in [4] are inspired by the noisy network 
coding scheme of [5] and, unlike [1], [3], do not use Wyner- 
Ziv binning [6] for the compression of the state. In a very 
recent contribution [7], Lapidoth and Steinberg derive a new 
irmer bound on the capacity region for the case of a single 
state governing the multiaccess channel. They also prove that 
the inner bound of [4] for the case of two independent states 
each known strictly causally to one encoder can indeed be 
strictly better than previous boixnds in [1], [3] - a result which 
is conjectured previously by Li, Simeone and Yener in [4]. 
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Fig. 1. State-dependent MAC with degraded message sets and states 
known noncausally at the encoder that sends both messages and only 
strictly causally at the other encoder. 

In this paper, which generalizes our former work [8], we 
study a two-user state-dependent multiple access charmel 
with the channel states known non-causally at one encoder 
and only strictly causally at the other encoder. The decoder 
is not aware of the channel states. As shown in Figure [H 
both encoders transmit a common message and, in addition, 
the encoder that knows the states non-causally transmits an 
individual message. More precisely, let Wc and Wi denote 
the common message and the individual message to be 
transmitted in, say, n uses of the channel; and S" = (Si, ... , S„) 
denote the state sequence affecting the channel during this 
time. At time i, Encoder 1 knows the complete sequence 



S" = (Si,...,S,-i,S„...,S„) and sends Xi, = </)i(W„ Wi, S"), 
and Encoder 2 knows only S'^^ = {Si, . . ., S,_i) and sends X2, - 
(p2,i(Wc,S'^^) - the functions (pi and (|)2,,- are some encoding 
functions. In this paper, we establish the capacity region of 
this state-dependent MAC model. As our analysis will show, 
this requires, among others, understanding the role of the 
strictly causal part of the state that is revealed to Encoder 2. 
We show that both compression of the state a-la noisy network 
coding, i.e., with no binning, and compression using Wyner- 
Ziv binning with or without non-unique decoding of the 
compresion indices are optimal. The question of whether non- 
unique decoding is beneficial in Wyner-Ziv type networks is 
important. Related very recent works can be found in [9], [10]. 
II. System Model and Definitions 

We consider a stationary memoryless state-dependent 
MAC Wy\Xi,X2,s whose output Y e y is controlled by the 
channel inputs Xi € Xi and X2 G X2 from the encoders 
and a channel state S € § drawn according to a memoryless 
probability law Qs- We assume that the channel state S" is 
known non-causally at Encoder 1, i.e., beforehand, at the 
beginning of the transmission block. Encoder 2 knows the 
channel states only strictly-causally; that is, at time ;', it knows 
the states only up to time i - 1, S'"^ - (Si, ... , S,_i). 

Both encoders transmit a common message and, in 
addition. Encoder 1 transmits also an individual message 
Wi. We assume that Wc and Wi are independent random 
variables drawn uniformly from the sets Wc = {!,••• , Mc] and 
Wi = {!,••• ,Mi j, respectively. The sequences X" and X'^ from 
the encoders are sent across a state-dependent multiaccess 
channel modeled as a memoryless conditional probability 
distribution Wy\Xi,X2,s- The joint probability mass function on 
WcXWiXS"xX'j'xX^xy" is given by 

PiiOc,zoi,s",x'l,xly") = P{iVc)Pizoi)Yl[Qsis,)P{xu\Wc,Wi,s") 

!=1 

■P{X2,,\Wc, s'''^)WY\Xi,X2.s{yi\Xl,i' ^2,i, Si)]. (1) 

The receiver guesses the pair (Wc, Wj) from the output Y". 

Definition 1: For positive integers n, Mc and Mi, an 
(Mc, Ml, n, e) code for the multiple access channel with states 
known noncausally at one encoder and only strictly causally 
at the other encoder consists of a mapping 

(pi : WcXWiXS" X'l (2) 

at Encoder 1, a sequence of mappings 

(p2,i : WcXS'-'^ -^X2, i = l,...,n (3) 

at Encoder 2, and a decoder map 

il^-.T ^ W,xWi (4) 

such that the average probability of error is bounded by e, 

= Es[Pt{iP{Y") + (Wc, Wi)|S" = s")] < e. (5) 

The rate of the common message and the rate of the individual 
message are defined as 

1 1 
R^^-logMc and Ri = -logMi, (6) 

respectively. 



A rate pair (Rc, Ki) is said to be achievable if for every e > 
there exists an (2"'^' ,2"^^,n,e) code for the channel Wy\Xi,X2,s- 
The capacity region of the considered state-dependent MAC 
is defined as the closure of the set of achievable rate pairs. 

Due to space limitation, the results of this paper are either 
outlined only or mentioned without proofs. Detailed proofs, 
converse proofs, the characterization of the capacity region in 
the Gaussian case as well as other results and discussions can 
be found in [11]. 

III. Discrete Memoryless Case 

In this section, it is assumed that S, Xi, X2 are finite. 

A. Capacity Region 

Let CP stand for the collection of all random variables 
(S,U,V,Xi,X2,Y) such that U, V, Xi and X2 take values in 
finite alphabets 11, V, Xi and X2, respectively, and 

Ps,u,v,x,,X2,Y(Sf u, V, xi, X2, y) 

= Ps,U.V,XiX2(s,U,V,Xi,X2)WY\Xi.X2,siy\^i'^2,s) (7a) 
PS,U,V,X,,X2(S, U, V, Xi,X2) 

= Qs(s)Px,(^2)Pl/|S,X,(l'|s,X2)f(J,Xi|S,y,X,(W/Jfl|s,W,X2) (7b) 

Ps,u.v,x,,X2{s,u,v,Xi,X2) = Qs(s). (7c) 

U,V,X\ ,X2 

The relations in imply that (U, V) «-> (S, Xi, X2) «^ Y is a 
Markov chain, and X2 is independent of S. 

Define 6 to be the set of all rate pairs (Rc, Ri) such that 

Ri < 1(U;Y\V,X2)-1{U;S\V,X2) 
Rc + Ri < I(U, V, X2; Y) - I(U, V, X2; S) 

for some (S, U, V, Xi, X2, Y) e 7. (8) 

The following proposition states some properties of C. 
Proposition 1: 

1. The set C is convex. 

2. To exhaust C, it is enough to restrict V and U to satisfy 

m<|S||li||X2l + l (9a) 

|U|<(|S||Xi||l2l + l)|S||Xi||X2|. (9b) 

As stated in the following theorem, the set C characterizes the 
capacity region of the state-dependent discrete memoryless 
MAC model that we study. 

Theorem 1: The capacity region of the multiple access 
channel with states known only strictly causally at the encoder 
that sends the common message and non-causally at the 
encoder that sends both messages is given by C. 

Remark 1: The proof of achievability of Theorem[T]is based 
on a block-Markov coding scheme in which a lossy version 
of the state is conveyed to the decoder, in the spirit of [1], [3], 
[7], in addition to a generalized Gel'fand-Pinsker binning for 
the transmission of the information messages [12]. However, 
unlike [1], [3] and [7] where Wyner-Ziv compression is 
utilized for the transmission of the lossy version of the state, 
here, inspired by the noisy network coding scheme of [5], at 
each block the compression index of the state of the previous 
block is sent using standard rate distortion, not Wyner- 
Ziv binning [6]. Also, unlike [1], [3] and [7] where every 



information message is divided into blocks and different 
submessages are sent over these blocks and then decoded 
one at a time using the same codebook as in the original 
compress-and-forward scheme by Cover and El Gamal [13], 
here the entire common message and the entire individual 
message are transmitted over all blocks using codebooks that 
are generated independently, one for each block, and the 
decoding is performed simultaneously using all blocks as 
in [5]. At the end of the transmission, the receiver uses the 
outputs of all blocks to perform simultaneous decoding of 
the information common and individual messages, without 
uniquely decoding the compression indices. □ 
Proof of Achievability: 

The transmission takes place in B blocks. The common 
message and the individual message Wi are sent over 
all blocks. We thus have Bw, = nBRc, Bw, = nBRi, N = nB, 
= By^/JN = Rc and Rvv, = B^JN = Ri, w^here Bw, is 
the number of common message bits, Bwj is the number of 
individual message bits, N is the number of channel uses 
and RvVr ^rid 7?w, are the overall rates of the common and 
individual messages, respectively. 

Codebook Generation: Fix a measure Ps.uxx^Xjj £ J'- Fix 

e > 0, 7], > 0, fji > 0, r) > 0, 6 > 1 and denote = 2"^^^'-'''^^], 

_ 2«BlRi-'7ie]^M = 2"t*'*"''''^l and / = 2"W-''^l^'''^2)+6e]^ 
We randomly and independently generate a codebook for 
each block. 

1) For each block i, i = 1, . . . , B, we generate McM inde- 
pendent and identically distributed (i.i.d.) codewords 
X2,!(Wc, t'.) indexed by ro^ = ■ - /Rcf = 1, • . • , M, each 
with i.i.d. components drawn according to Pxj- 

2) For each block i, for each codeword X2^i(Wc, t'.), we 
generate M i.i.d. codewords v,(Wc, t'., ti) indexed by f, - 

1. . . . , M, each with i.i.d. components drawn according to 
Rv\X2 ■ 

3) For each block for each codeword X2,,(it'c, f •), for each 
codeword v,(Wc, t'., f,), we generate a collection of /Mi 

1.1. d. codewords {\Xi{Wc,t'.,ti,Wi, ji)] indexed by Wi - 
1, . . . , Ml, ji = 1, . . . , /, each with i.i.d. components draw 
according to PL;|y,X2- 

Encoding: Suppose that a common message = Wc and 
an individual message Wi = Wi are to be transmitted. As we 
mentioned previously, and Wi will be sent over all blocks. 
We denote by s[i\ the state affecting the channel in block i, 
i = 1, . . . , B. For convenience, we let s[0] = and f _i = fo = 1 
(a default value). The encoding at the beginning of block i, 
i = 1, . . . , B, is as follows. 

Encoder 2, which has learned the state sequence s[i-l], knows 
t,_2 and looks for a compression index t,_i € [1 : M] such that 
v,-i(ifcA fj-iA fi-i) is strongly jointly typical with s[f - 1] and 
X2,i-i(roc, fi-2)- If there is no such index or the observed state 
s[; - 1] is not typical, f,_i is set to 1 and an error is declared. 
If there is more than one such index f,_i, choose the smallest. 
Encoder 2 then transmits the vector X2,,(Wc, t,_i). 
Encoder 1 obtains X2,,(iyc, ti-i) similarly. It then finds the 
smallest compression index € [1 : M] such that v,(Wc, t,_„ f,) 
is strongly jointly typical with s[i] and X2,,(iyc, f,-i). Again, if 
there is no such index or the observed state s[f] is not typical, 
ti is set to 1 and an error is declared. Next, Encoder 1 looks for 
the smallest /'/ such that Ui{Wc,ti-i,ti,iVi, ji) is jointly typical 



with sli] given (x2,i{Wc,ti-i),Vi{iVc,ti-i,ti)). Denote this ;', by 
i* = j{s[i],iVc,ti-i,ti,zvi). If such i* is not found, an error is 
declared and 7(s[z], Wc, t,-i, f,, zui) is set to = /. Encoder 1 then 
transmits a vector xi[;] which is drawn i.i.d. conditionally 
given u,{iUc,t,-i,t„zvi,j*), s[i\, Vi{zVc,ti.i,ti) and X2,,(iyc, f,-i) 
(using the conditional measure -Pxi|!j,s,v,X2 induced by l|7)). 

Decoding: At the end of the transmission, the decoder has 
collected all the blocks of channel outputs y[l], . . . , y[B]. 
Step (a): The decoder estimates message ifc using all blocks = 
1, . . . , B, i.e., simultaneous decoding. It declares that ibc is sent 
if there exist = {ti, . . . , fg) e [1 : Mf, lOt e [1 : MJ and = 
(71, ... , 7b) e [1 : Jf such that X2,,(z&c, t,_i), u,(Wc, f,-i, t„ zvi,j,), 
v/(z&c, t,_i, f,) and y[/] are jointly typical for all f = 1, . . . , B. One 
can show that the decoder obtains the correct zv^ as long as n 
and B are large and 

Rc+Ri< I{U, V, X2; Y) - I{U, V, X2; S). (10) 

Step (b): Next, the decoder estimates message zoi using again 
all blocks i = 1, . . . ,B, i.e., simultaneous decoding. It declares 
that 1^1 is sent if there exist t^ = (ti,. . .,{5) € [1 : M]^, j^ = 
{]!,■■ ■, 7b) e [1 : ]f such that X2,,(it;„ f,-i), u,(zOc, t,_i, t„ Wi, 7,), 
Vi{Wc, t,_i, ti) and y{i\ are jointly typical for all z = 1, . . . , B. One 
can show that the decoder obtains the correct lUi as long as n 
and B are large and 

Ri < I(U; Y\V, X2) -I{U;S\V, X2) (11a) 
Ri < I{U, V, X2; Y) - liU, V, X2; S). (lib) 

□ 

B. Wyner-Ziv Binning With Noniinique Decoding is Optimal 

In the coding scheme of Theorem[TJ the state compression is 
standard, i.e., uses no Wyner-Ziv binning, the same message 
is sent in every block, and the decoding of the sent message is 
performed jointly using all blocks. Although of no benefit 
in the case of one relay, the combination of these three 
features was shown to be essential in achieving rates that 
are strictly larger than those offered by schemes based on 
Cover and El Gamal classic compress-and-forward scheme 
[13] for certain networks with multiple relays in [5]. That is, 
the coding scheme of [5] outperforms Cover and El Gamal 
classic compress-and-forward for some multi-relay networks 
in [5]. One can wonder whether the same holds for our 
model, i.e., whether schemes based on Cover and El Gamal 
classic compress-and-forward, i.e., block Markov encoding 
combined with Wyner-Ziv binning, fall short of achieving 
optimality for our model. In this paper, we show that the 
capacity region 6 as given by (5) can be achieved alternatively 
with a coding scheme that we obtain by building upon 
and modifying Cover and El Gamal original compress-and- 
forward scheme. The modification consists essentially in 1) 
decoding block-by-block backwardly instead of block-by- 
block forwardly and 2) non-unique decoding of the com- 
pression indices. (In fact, by investigating more closely the 
converse proof of Theorem [ij we will show later that 2) can 
be relaxed essentially without altering the capacity region). 
The following theorem states the result. 

Theorem 2: For the state-dependent multiaccess channel 
model that we study, there exists an optimal coding scheme 
that uses Wyner-Ziv binning for the state compression. That 



is, the capacity region C given by {8) can also be achieved using 
a coding scheme in which the state compression is performed 
using Wyner-Ziv binning. 

Proof: The achievability proof of Theorem |2] is based 
on a block-Markov coding scheme that combines carefully 
Gel'fand-Pinsker binning and Wyner-Ziv binning, and uti- 
lizes backward decoding with non-unique decoding of the 
compression indices. 

The transmission takes place in B blocks. The common 
message is divided into B blocks zVc^i, . . . ,Wc,b of nRc 
bits each, and the individual messages Wi are divided into 
B blocks Wi^i, . . . ,zui^B of nRi bits each. For convenience, 
we let iVc^B = M'l.B = 1 (a default value). We thus have 
Bw, = m(b'- Bw, = n{B - l)Ri, N = nB, Rw,- = B„JN = 
RciB - 1)/B and Rw, = BwJN = RyiB - 1)/B, where Bw, 
is the number of common message bits, Byy, is the number 
of individual message bits, N is the number of channel uses 
and RvVr ^rid are the overall rates of the common and 
individual messages, respectively. For fixed n, the average 
rate pair (Rw^,Rv/J over B blocks can be made as close to 
(Rc, Ri) as desired by making B large. 

Codebook Generation: Fix a measure Ps.uxXuXjj £ 7'- Fix e > 
and denote M, = 2"[^'-''''^], Mi = 2"t^i-'"'^], Mo = T^'^'>^v\ 

1) We generate McMq independent and identically dis- 
tributed (i.i.d.) codewords X2{iOc,s) indexed by Wc — 
l,...,Rc, s = 1, ...,Mo, each with i.i.d. components 
drawn according to ■ 

2) For each codeword X2{Wc, s), we generate M independent 
and identically distributed (i.i.d.) codewords 'v{iVc,s,z) 
indexed by z = 1, ...,M, each with i.i.d. components 
drawn according to Pv\X2 ■ 

3) For each codeword X2(Wc, s), for each codeword v(iUc, s, z), 
we generate a collection of /Mi i.i.d. codewords 
{u(M;c,s,z,Wi,/)j indexed by Wi = 1, ...,Mi, ; = 1, ...,/, 
each with i.i.d. components draw according to PuKXj- 

4) Randomly partition the set {!,..., M) into Mo cells Cs, 

SG[l,Mo]. 

Encoding: Suppose that a common message = iVc 
and an individual message Wi = Wi are to be transmitted. 
As we mentioned previously, message Wc is divided into B 
blocks ■ • -/Wc^B and message Wi is divided into B blocks 
Wi^i, . . . , Wi^B, with {Wc^i, Wi^i) the pair messages sent in block 
i. We denote by s[z] the channel state in block i, i - 1, . . . , B. 
For convenience, we let s[0] = (p and Zq = 1 (a default value), 
and So the index of the cell containing Zo, i.e., Zo e Cs„ . The 
encoding at the beginning of the block i, i = 1, . . . , B, is as 
follows. 

Encoder 2, which has learned the state sequence s[!-l], knows 
s,_2 and looks for a compression index z,_i e [1,M] such that 
v(Wc,;-1aS,-2, z,_i) is strongly jointly typical with s[i — 1] and 
X2(Wc,,_i,s,_2)- If there is no such index or the observed state 
sli — 1] is not typical, z,_i is set to 1 and an error is declared. 
If there is more than one such index z,_i, choose the smallest. 
One can show that the probability of error of this event is 
arbitrarily small provided that n is large and 

R>I{V;S\X2). (12) 



Encoder 2 then transmits the vector X2(ryc,„ s,_i), where s,_i 
is such that z,_i € Cs,_, . 

Encoder 1 obtains X2(ifc,!, s,_i) similarly. It then finds the 
smallest compression index z, € [1,M] such that v{iOcj, s,_i, z,) 
is strongly jointly typical with sli] and X2(ifc,!, s,_i). Again, 
if there is no such index or the observed state s[f] is not 
typical, z, is set to 1 and an error is declared. Let s, e 
[l,Mo] such that Z; G Cg,. Next, Encoder 1 looks for the 
smallest such that u(iyc^„s,_i,z,-, z(;i„;,) is jointly typical 
with s[f], X2(Wc,„s,_i) and v(i(;c_,,s,_i,z,). Denote this /, by 
;* = j{s[i],Wi;j,Si^i,Zi,iUij). If such /* is not found, an 
error is declared and 7(s[z], ifc,„s,_i,z„Wi,,) is set to /, = /. 
Encoder 1 then transmits a vector xili] which is drawn i.i.d. 
conditionally given s[i], u(n;c^„s,_i,z;, Wi^;, /*), v{Wc,i,Si^i,Zi) 
and X2{iOcj,Si-i) (using the conditional measure Pxi\s,u,v,X2 
induced by Ps,u,y,Xi,X2,Y e 7). 

1) Decoding in Block B - 1: 

The decoding of the pair {w^^b-i, h^i,b-i) is performed in four 
steps, as follows. 

Step (a): The decoder knows Wc,b = 1 and looks for the unique 
cell index Sb-i such that the vector X2{zVc^b,Sb-i) is jointly 
typical with y[B]. The decoding operation in this step incurs 
small probability of error as long as n is sufficiently large and 

Ro<I{X2;y). (13) 

Step (b): The decoder now knows sb-i (i.e., the in- 
dex of the cell in which the compression index Zb-i 
lies). It then decodes message ifcB-i by looking for the 
unique z&c,b-i such that X2(ii'c,B-i,SB-2), 'v{zVc_b-i,Sb-2,Zb-i), 
u(Wc,B-i,SB-2,ZB-i,zi'i,B-i,7B-i) and y[B - 1] are jointly typical 
for some Sb_2 g [1,Mo], zv^b-i 6 [l,Mi], /b-i g [1,/] and 
Zb_i € Cs-g_j . One can show that the decoder obtains the correct 
ifc B-i as long as n and B are large and 

Ro + {R- Ro) + Rc + Ri< I{U, V, Xr, Y) - I{U; S\V, X2). (14) 

Step (c): The decoder knows ly^B-i and can again obtain the 
correct Sb_2 if w is large and l |13t is true. This is accomplished by 
looking for the unique 85-2 such that the vector X2(ti'c,B-i, SB-2) 
is jointly typical with y[B - 1]. 

Step (d): Finally, the decoder, which now knows message 
zVc^B-i and the cell index Sb_2 (but not the exact compression in- 
dex Zg-i), estimates Wi^b-i using y[B - 1]. It declares that z&i^b-i 
was sent if there exists a unique z&i^b-i such that X2 (zt'c,B-i, SB-2), 
v{zv,,B-i, Sb-2, Zb_i), u(Wc,b-i, Sb-2, ^-1' ^\b-i, Jb-i) and y[B - 1] 
are jointly typical for some Zg J G Gss_i and /'b_i G [1,/]. 

• If Zg J = Zb-1, the decoder finds the correct zfi^b-i for 
sufficiently large n if 

Ri < I{U; Y\V, X2) - I{U; S\V, X2). (15) 

• If Zg J ?i Zb_i, the decoder finds the correct ifi,;,-i for 
sufficiently large n if 

{R - Ro) +Ri< I{U, V; Y\X2) - I(!J; S\V, X2). (16) 

2) Decoding in Block b, b = B - 1,B - 2, . . . ,2: 

Next, for b ranging from B - 1 to 2, the decoding of the pair 
(iff ;,_i, zvi ij^i) is performed similarly, in five steps, by using 
the information y[fo] received in block b and the information 
ylb — 1] received in block b — 1. More specifically, this is done 
as follows. 



Step (a): The decoder knows lu^ i, and looks for the unique 
cell index S;,_i such that the vector X2(ti'c,b, S;,-i) is jointly 
typical with y[&]. The decoding error in this step is small 
for sufficiently large n if l ll3t is true. 

Step (b): The decoder knows S(,_i and decodes message iVc^-i 
from y[b] . It looks for the unique iVc^h-i such that X2 {iVc^b-ii S6-2)/ 
v{Wc,h-i,Sb-2,Zh-i), n{Wc,b-i,Sb-2,Zb-i,Wi_t-i,jb-i) and y[b - 1] 
are jointly typical for some Sb-2 € [l,Mo], zoi b-i G [l,Mi], 
jb-i € [1, /] and Zb^i € Cs,,_, . One can show that the decoding 
error in this step is small for sufficiently large n if l[T4) l is true. 
Step (c): The decoder knows z&c,6-i and obtains s;,_2 by looking 
for the unique 4-2 such that the vector X2(i&c,fc-i/ S(,_2) is jointly 
typical with y[b — 1]. For sufficiently large n, the decoder 
obtains the correct Sj„2 with high probability if | [T3t is true. 
Step (d): Finally, the decoder, which now knows mes- 
sage Wf i,-! and the cell index S{,_2 (but not the exact 
compression index 2;,_i), estimates message ivi^b-i using 
y[b — 1]. It declares that Wi,;,-! was sent if there exists 
a unique Wi_b-i such that X2{idcfi-i,Sb-2), v(t&c,f,-i,S6_2,2|,_j), 
u(Wc,b-i,Sb-2,z'j^_ym,b-i, jb-i) and y[b - 1] are jointly typical 
for some z^^ G C^v, and jb-i e [1,/]. 

• If Zj^^ = Zft-i, the decoder finds the correct for 
sufficiently large n if l llSt is true. 

• If z^ J z;,_i, the decoder finds the correct Wib-i for 
sufficiently large n if l ll6t is true. 

Applying Fourier-Motzkin Elimination (FME) to project 
out Ro and R from |[T3]l, l|T4t , l llSt and | [T6t , we get the desired 
result (8). □ 

C. Non-unique Decoding is Not Needed 

As we mentioned previously, the coding scheme of The- 
orem 12] shares elements with Cover and El Gamal original 
compress-and-forward [13, Theorem 7]; but differs from it 
mainly in two aspects. First, it uses backward decoding 
instead of the forward decoding of [13]; and, second, unlike 
[13] it does not require unique decoding of the compression 
indices. The second aspect is essential for getting the same rate 
expression as in l|8), with no additional constraints. However, 
as we will see shortly in the corollary that will follow, one 
can modify the coding scheme of Theorem |2] in a way to get 
the compression indices decoded uniquely and still get the 
capacity region, at the expense of slightly larger |V| and larger 
11X1. The key element is the observation that the constraint 
introduced by getting the compression index decoded, i.e., 

I{V; SIX2) - I{V; Y\X2) < ^Xr, Y), (17) 

or, equivalently, 

W, X2; Y) - W, X2; S) > 0, (18) 

is also implicit in the converse proof of Theorem[T] That is, the 
auxiliary random variables U and V of the converse proof of 
Theorem [1] satisfy l llSt . 

Theorem 3: The coding scheme of Theorem |2] can be 
modified in a way to get the compression index decoded. 
The resulting coding scheme is optimal and achieves an 
equivalent characterization of the capacity region of the 
model that we study given by the set of all rate pairs (Rc, Ri) 
such that 



Ri < I{U;Y\V,X2)-I(U;S\V,X2) 
Rc+Ri < I{U, V, X2; Y) - I{U, V, X2; S) (19) 

for some measure (S, U, V, Xi, X2, Y) e T and satisfying 

W, X2; Y) - W, X2; S) > 0, (20) 

where the auxiliary random variables V and \1 have their 
alphabets bounded as 

|V|<|S||Xi||J2l+2 (21a) 
|U|<(|S||Xi||X2l + 2)|S||Xi||l2|. (21b) 

Proof: The coding scheme that we use for the proof of 
Theorem |3] is very similar to that of Theorem |2l but with 
unique decoding of the compression indices. (See [11]). 
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